Toshiba BRIDJE at NTCIR-6 CLIR: The Head/Lead Method and Graded Relevance Feedback

نویسندگان

  • Tetsuya Sakai
  • Makoto Koyama
  • Tatsuya Izuha
  • Akira Kumano
  • Toshihiko Manabe
  • Tomoharu Kokubu
چکیده

At NTCIR-6 CLIR, Toshiba participated in the Monolingual and Bilingual IR tasks covering three topic languages (Japanese, English and Chinese) and one document language (Japanese). For Stage 1 (which is the usual ad hoc task using the new NTCIR6 topics), we submitted two DESCRIPTION runs and two TITLE runs for each topic language. Our first search strategy is Selective Sampling with Memory Resetting, and our second one is the Head/Lead method, which uses the Selective Sampling run as one of the components for data fusion. According to the Relaxed and Rigid Mean Average Precision statistics released by the organisers, we are the top performer in all six subtasks. For Stage 2 (which reused the NTCIR-3, 4 and 5 test collections), we repeated our two Stage 1 strategies in order to enable analysis across all four test collections. Moreover, we conducted some unofficial true relevance feedback experiments by exploiting the graded relevance data provided in the test collections. Our automatic run results show that the Head/Lead method slightly but consistently improves performance, while our unofficial “interactive” run results suggest that graded-relevance metrics favour graded relevance feedback while Average Precision favours binary relevance feedback. In addition, our significance tests suggest that the NTCIR-6 Japanese test collection is “harder” than previous collections.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Toshiba BRIDJE at NTCIR-6 CLIR

At NTCIR-6 CLIR, Toshiba participated in the Monolingual and Bilingual IR tasks covering three topic languages (Japanese, English and Chinese) and one document language (Japanese). For Stage 1 (which is the usual ad hoc task using the new NTCIR6 topics), we submitted two DESCRIPTION runs and two TITLE runs for each topic language. Our first search strategy is Selective Sampling with Memory Rese...

متن کامل

Toshiba BRIDJE at NTCIR-4 CLIR: Monolingual/Bilingual IR and Flexible Feedback

Toshiba participated in the Monolingual/Bilingual tasks at NTCIR-4 CLIR using our CLIR system called BRIDJE. We submitted 24 runs covering three topic languages (Japanese, English and Chinese) and two document languages (Japanese and English) and achieved the highest performances in the E-J-D, CJ-D, C-J-T, E-E-D, J-E-D, J-E-T subtasks. We had 12 more runs which we were not allowed to submit due...

متن کامل

Toshiba BRIDJE at NTCIR-5 CLIR: Evaluation using Geometric Means

Toshiba participated in the Monolingual and Bilingual IR tasks at NTCIR-5 CLIR using the BRIDJE system. We submitted 24 runs covering three topic languages (Japanese, English and Chinese) and two document languages (Japanese and English), and achieved the highest performances in the E-J-T, EJ-D, C-J-T, C-J-D, J-E-T and J-E-D subtasks. This paper (re-)examines Partial Disambiguation and the Pivo...

متن کامل

Toshiba BRIDJE at NTCIR-5 CLIR

Toshiba participated in the Monolingual and Bilingual IR tasks at NTCIR-5 CLIR using the BRIDJE system. We submitted 24 runs covering three topic languages (Japanese, English and Chinese) and two document languages (Japanese and English), and achieved the highest performances in the E-J-T, EJ-D, C-J-T, C-J-D, J-E-T and J-E-D subtasks. This paper (re-)examines Partial Disambiguation and the Pivo...

متن کامل

ISCAS in CLIR at NTCIR-6: Experiments with MT and PRF

We participated in the English-Chinese cross-language information retrieval (CLIR) E-C tasks in NTCIR6. Considering the special feature of crossing two different languages in CLIR, our main concerns in our experiment are 1) to evaluate the appropriateness of MT as a means of query translation in CLIR, 2) to evaluate the effect of feedback in retrieval model to the performance of CLIR which has ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007